Search CORE

105 research outputs found

Estimating articulatory parameters from the acoustic speech signal

Author: Richmond Korin
Publication venue: The University of Edinburgh
Publication date: 01/01/2002
Field of study

Edinburgh Research Archive

Lip Synchronization by Acoustic Inversion

Author: Berger Michael
Hofer Gregor
Richmond Korin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

Crossref

Edinburgh Research Explorer

On Generating Combilex Pronunciations via Morphological Analysis

Author: Clark Robert
Fitt Sue
Richmond Korin
Publication venue
Publication date: 01/09/2010
Field of study

Combilex is a high-quality lexicon that has been developed specifically for speech technology purposes and recently released by CSTR. Combilex benefits from many advanced features. This paper explores one of these: the ability to generate fully-specified transcriptions for morphologically derived words automatically. This functionality was originally implemented to encode the pronunciations of derived words in terms of their constituent morphemes, thus accelerating lexicon development and ensuring a high level of consistency. In this paper, we propose this method of modelling pronunciations can be exploited further by combining it with a morphological parser, thus yielding a method to generate full transcriptions for unknown derived words. Not only could this accelerate adding new derived words to Combilex, but it could also serve as an alternative to conventional letter-to-sound rules. This paper presents preliminary work indicating this is a promising direction

Edinburgh Research Explorer

Edinburgh Research Archive

Feature-space transform tying in unified acoustic-articulatory modelling of articulatory control of HMM-based speech synthesis

Author: Ling Zhen-Hua
Richmond Korin
Yamagishi Junichi
Publication venue
Publication date: 01/08/2011
Field of study

Edinburgh Research Explorer

Comparison of HMM and TMD Methods for Lip Synchronisation

Author: Hofer Gregor
Richmond Korin
Publication venue
Publication date: 24/09/2010
Field of study

This paper presents a comparison between a hidden Markov model (HMM) based method and a novel artificial neural network (ANN) based method for lip synchronisation. Both model types were trained on motion tracking data, and a perceptual evaluation was carried out comparing the output of the models, both to each other and to the original tracked data. It was found that the ANN-based method was judged significantly better than the HMM based method. Furthermore, the original data was not judged significantly better than the output of the ANN method

Edinburgh Research Archive

Enhancing Sequence-to-Sequence Text-to-Speech with Morphology

Author: Richmond Korin
Taylor Jason
Publication venue: 'International Speech Communication Association'
Publication date: 25/10/2020
Field of study

Crossref

Edinburgh Research Explorer

Confidence Intervals for ASR-based TTS Evaluation

Author: Richmond Korin
Taylor Jason
Publication venue: 'International Speech Communication Association'
Publication date: 03/09/2021
Field of study

Edinburgh Research Explorer

Generating gestural timing from EMA data using articulatory resynthesis

Author: Richmond Korin
Steiner I.
Publication venue
Publication date: 01/01/2008
Field of study

As part of ongoing work to integrate an articulatory synthesizer into a modular TTS platform, a method is presented which allows gestural timings to be generated automatically from EMA data. Further work is outlined which will adapt the vocal tract model and phoneset to English using new articulatory data, and use statistical trajectory models

Edinburgh Research Archive

Edinburgh Research Explorer

Comparison of HMM and TMDN Methods for Lip Synchronisation

Author: Hofer Gregor
Richmond Korin
Publication venue
Publication date: 01/09/2010
Field of study

This paper presents a comparison between a hidden Markov model (HMM) based method and a novel artificial neural network (ANN) based method for lip synchronisation. Both model types were trained on motion tracking data and a perceptual evaluation was carried out comparing the output of the models, both to each other and to the original tracked data. It was found that the ANN based method was judged significantly better than the HMM based method. Furthermore the original data was not judged significantly better than the output of the ANN method. Index Terms: hidden Markov model, mixture density network, lip synchronisation, inversion mappin

CiteSeerX

Edinburgh Research Explorer

Redundancy and productivity in the speech technology lexicon - can we do better?

Author: Fitt Sue
Richmond Korin
Publication venue
Publication date: 01/09/2006
Field of study

Edinburgh Research Explorer